fix: honor Hygon DCU SLA topk environment setting by starrkk · Pull Request #1197 · ModelTC/LightX2V

starrkk · 2026-06-30T05:01:24Z

Summary

make the Hygon DCU sparse-attention fallback honor the same default top-k ratio used by the SLA path
keep the behavior controlled by SPARSE_ATTN_TOPK

Why

This fixes an inconsistency where the Hygon DCU fallback path used a different top-k default than the optimized runtime path.

Validation

branch rebuilt on latest ModelTC/LightX2V:main (89dfa833)
git diff --check passed for the PR branch

(cherry picked from commit e8ee93a79bd20dce2d084e992a8e140710f2c9b6)

gemini-code-assist

Code Review

This pull request updates the sparse attention configuration in flash_attn.py by changing the default fallback value of the SPARSE_ATTN_TOPK environment variable to 0.4 and dynamically passing topk_value instead of a hardcoded value. The review feedback recommends adding validation and error handling when parsing SPARSE_ATTN_TOPK to prevent potential runtime crashes from invalid float values or values outside the expected range.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-30T05:01:58Z

        # Use Flash Attention 2.6.1 (ROCm version) with varlen interface
        if SAPRDE_LINEAR_ATTN and int(os.getenv("USE_SLA", 0)) and q.shape[1] == k.shape[1]:
-            topk_value = float(os.getenv("SPARSE_ATTN_TOPK", "0.5"))
+            topk_value = float(os.getenv("SPARSE_ATTN_TOPK", "0.4"))


Parsing the environment variable SPARSE_ATTN_TOPK directly into a float without validation can lead to runtime crashes (if the value is not a valid float) or unexpected behavior/crashes in the underlying sparse attention kernel (if the value is outside the valid range of (0.0, 1.0]). It is safer to wrap this in a try-except block and validate that the resulting value is within the expected range.

try: topk_value = float(os.getenv("SPARSE_ATTN_TOPK", "0.4")) if not (0.0 < topk_value <= 1.0): raise ValueError except ValueError: logger.warning("Invalid SPARSE_ATTN_TOPK value. Falling back to default 0.4.") topk_value = 0.4

helloyongyang · 2026-07-01T04:56:15Z

We will check this pr.

And please pay attention to the code format:

pip install ruff pre-commit

pre-commit run --all-files

helloyongyang · 2026-07-01T04:59:15Z

Do not use environment variables to control the sparsity of attention, especially since this switch only works under hygons.

You can write two separate attention classes. See:

https://github.com/ModelTC/LightX2V/blob/main/lightx2v/common/ops/attn/flash_attn.py

https://github.com/ModelTC/LightX2V/blob/main/lightx2v/common/ops/attn/sla_attn.py

starrkk · 2026-07-01T08:38:40Z

@helloyongyang

Do not use environment variables to control the sparsity of attention, especially since this switch only works under hygons.

You can write two separate attention classes. See:

https://github.com/ModelTC/LightX2V/blob/main/lightx2v/common/ops/attn/flash_attn.py

https://github.com/ModelTC/LightX2V/blob/main/lightx2v/common/ops/attn/sla_attn.py

Thanks for the feedback. This PR mainly fixes an issue where topk does not take effect in the Hygon DCU SLA attention path. The current SLA interface uses a hardcoded topk value instead of the value from SPARSE_ATTN_TOPK.
I understand the concern about switching SLA inside flash attention via environment variables. If the preferred design is to keep flash_attn and sla_attn as separate implementations, I can update this PR and split the Hygon SLA attention into a separate class / registry entry.

fix: honor SLA topk environment setting

be16eac

(cherry picked from commit e8ee93a79bd20dce2d084e992a8e140710f2c9b6)

gemini-code-assist Bot reviewed Jun 30, 2026

View reviewed changes

starrkk marked this pull request as ready for review June 30, 2026 05:18

fix: validate Hygon SLA topk environment value

a89f1b7

style: format Hygon SLA topk changes

5999f50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: honor Hygon DCU SLA topk environment setting#1197

fix: honor Hygon DCU SLA topk environment setting#1197
starrkk wants to merge 3 commits into
ModelTC:mainfrom
starrkk:codex/hygon-sla-topk-env

starrkk commented Jun 30, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 30, 2026

Uh oh!

helloyongyang commented Jul 1, 2026

Uh oh!

helloyongyang commented Jul 1, 2026

Uh oh!

starrkk commented Jul 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

starrkk commented Jun 30, 2026

Summary

Why

Validation

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

helloyongyang commented Jul 1, 2026

Uh oh!

helloyongyang commented Jul 1, 2026

Uh oh!

starrkk commented Jul 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

starrkk commented Jul 1, 2026 •

edited

Loading